Informative Labeling Schemes for the Least Common Ancestor Problem

نویسندگان

  • Saverio Caminiti
  • Irene Finocchi
  • Rossella Petreschi
چکیده

We address the problem of labeling the nodes of a tree such that one can determine the identifier of the least common ancestor of any two nodes by looking only at their labels. This problem has application in routing and in distributed computing in peer-to-peer networks. A labeling scheme using Θ(log n)-bit labels has been presented by Peleg. By engineering this scheme and a new one due to the authors, we obtain a variety of data structures with the same asymptotic performances. We conduct a thorough experimental evaluation of all these data structures. Our results clearly show which variants achieve the best performances in terms of space usage, construction time, and query time. Effective representations of large, geographically dispersed communication networks should allow the users to efficiently retrieve information about the network in a distributed and localized way. Labeling schemes provide an answer to this problem by assigning labels to the network nodes in such a way that queries can be computed alone from the labels of the involved nodes, without any extra information source. The primary goal of a labeling scheme is to minimize the maximum label length, while keeping queries fast. Adjacency labeling schemes were first introduced by Breuer and Folkman in [5, 6], and further studied in [11]. The interest in informative labeling schemes, however, was revived only more recently, after Peleg showed the feasibility of the design of efficient labeling schemes capturing distance information [15]. Since then, upper and lower bounds for labeling schemes have been proved on a variety of graph families (including weighted trees, bounded arboricity graphs, intersection-based and cdecomposable graphs) and for a large variety of queries, including distance [2, 8, 10], tree ancestry [1, 3], flow and connectivity [13]. In spite of a large body of theoretical works, to the best of our knowledge only few experimental investigations of the efficiency of informative labeling schemes have been addressed in the literature [8, 12]. In our work [7] we focus on labeling schemes for answering least common ancestor queries in trees. Labeling schemes for least common ancestors are mainly useful in routing messages on tree networks: the ability to compute the identifier of the least common ancestor of any two nodes u and v turns out to be useful when a message has to be sent from u to v in the network, because the message has to go through lca(u, v). Other applications are related to query processing in XML search engines and distributed computing in peer-to-peer networks (see, e.g., [3, 4, 12]). In [16], Peleg has proved that for the class of n-node trees there exists a labeling scheme for least common ancestors using Θ(log n)-bit labels, which is also shown to be asymptotically optimal. We finally remark that, when node levels are known, it is trivial to compute the distance between any two nodes given their lca. Therefore, all the data structures considered in this work can be easily exploited to answer distance queries.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Informative Labeling Schemes for Graphs

This paper introduces the notion of informative labeling schemes for arbitrary graphs. Let f (W) be a function on subsets of vertices W . An f labeling scheme labels the vertices of a weighted graph G in such a way that f (W) can be inferred (or at least approximated) efficiently for any vertex subset W of G by merely inspecting the labels of the vertices of W, without having to use any additio...

متن کامل

Engineering Tree Labeling Schemes: A Case Study on Least Common Ancestors

We address the problem of labeling the nodes of a tree such that one can determine the identifier of the least common ancestor of any two nodes by looking only at their labels. This problem has application in routing and in distributed computing in peer-to-peer networks. A labeling scheme using Θ(log n)-bit labels has been previously presented by Peleg. By engineering this scheme, we obtain a v...

متن کامل

Near-optimal labeling schemes for nearest common ancestors

We consider NCA labeling schemes: given a rooted tree T , label the nodes of T with binary strings such that, given the labels of any two nodes, one can determine, by looking only at the labels, the label of their nearest common ancestor. For trees with n nodes we present upper and lower bounds establishing that labels of size (2± ǫ) log n, ǫ < 1 are both sufficient and necessary. Alstrup, Bill...

متن کامل

Nearest Common Ancestors: Universal Trees and Improved Labeling Schemes

We investigate the nearest common ancestor (NCA) function in rooted trees. As the main conceptual contribution, the paper introduces universal trees for the NCA function: For a given family of rooted trees, an NCA-universal tree S is a rooted tree such that any tree T of the family can be embedded into S such that the embedding of the NCA in T of two nodes of T is equal to the NCA in S of the e...

متن کامل

Labeling Schemes for Nearest Common Ancestors through Minor-Universal Trees

Preprocessing a tree for finding the nearest common ancestor of two nodes is a basic tool with multiple applications. Quite a few linear-space constant-time solutions are known and the problem seems to be well-understood. This is however not so clear if we want to design a labeling scheme. In this model, the structure should be distributed: every node receives a distinct binary string, called i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009